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Abstract. Group-based models arise in algebraic statistics while studying evolution pro- 
cesses. They arc represented by embedded algebraic varieties, that turn out to be toric. 
Both from the theoretical and applied point of view one is interested in determining the 
ideals defining the varieties. Conjectural bounds on the degree in which these ideals are 
^ . generated were given by Sturmfels and Sullivant |SS051 Conjectures 29, 30]. We prove that 

for the 3-Kimura model, corresponding to the group G = Z2 x Z2 , the projective scheme can 
be defined by an ideal generated in degree 4. In particular, it is enough to consider degree 
4 phylogcnctic invariants to test if a given point belongs to the variety. Moreover, we give a 
constructive proof, that for any fixed, finite abclian group G there exists a constant d, such 
that for any tree the associated projective scheme can be defined by an ideal generated in 
degree at most d. 

1. Introduction 

We investigate properties of a special class of not necessarily normal, projective toric 
varieties. Their construction is motivated by applied mathematics, namely phylogenetics. 
We will not describe the connections with applications, referring to |SS03j . |PS05j . On the 
other hand we give an exact mathematical definition of the class of varieties we consider. 
We present the problems in toric geometry, arising from applied mathematics, so that pure 
mathematicians can address them. 

Let T be a rooted tree, that is a connected graph without cycles and with one distinguished 
vertex Vq called the root. We direct the edges of T away from the root. Let V be the set 
of vertices of T. For each v G V, consider a vector space W v with a fixed basis e\, . . . , e v , v y 
We will construct subvarieties in <S> veL W v , where L is the set of leaves of T. The variety is 
given as a closure of the parametrization map 

<t> ■ w vo x J] w* vi w V2 -+(g)W v , 

where the product Yl( Vl v 2 ) * s taken over all edges of T. Consider the basis <S> v€V ej/ ^ of the 
vector space <S> v eL W v , indexed by functions j that associate to a vertex v an index from 1 
to n{y). We define by: 

veV ("1,1*2) i (vi,V2) 

where w v * iV2 G W* x (8> W V3 and the sum £\ is taken over all functions that associate to a 
vertex v an index of a basis vector of W v and agree with j, when v is a leaf. As usually by 
* we denote the dual vector, with respect to the chosen basis. 
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The image of (j) is called the general Markov model. Let K\ n be the claw tree - cf. 
Definition I3.2[ with the inner vertex vq. We encourage the reader to check that for this 
tree and W VQ one dimensional, the obtained variety is the Segre embedding of V\, . . . , V n . 
Moreover, when W vo is k dimensional, the Zariski closure of the image is the k-th secant 
variety of the Segre variety. More details on this construction, including motivation, can be 
found in |PS05t Chapter 4] or a short paper [ERSS04j . 

We are interested in equivariant versions of the above construction. First examples of such 
appeared in phylogenetics, the science that aims at reconstructing the evolution of DNA 
|Kim80j . |Kim81j . |JC69j . It was observed that certain symmetries between nucleobasis of 
DNA can be induced by an action of a group. Important mathematical implications of such 



a description of the variety were proven in |HP89)H . Along with other works, [ES93| ISSE93] 



they proved that, using modern language, if the group is sufficiently big, the obtained variety 
is toric - we do not assume that toric varieties have to be normal. Such models are called 
group-based and are the central object of study of this article. 

In this setting we assume that all vector spaces W v = W are regular representations of 
a fixed finite abelian group G. Thus we can identify the basis with group elements. We 
restrict the domain of to G-invariant vectors. More precisely we consider: 

0g : W G x Y[ (W* <g> Wf -> (g) W. 

(v\,V2) VdL 

where the action of the group on the tensor product and on the dual is induced from the 
vector space. Note that W is one dimensional, hence the image coincides with the image 
°f Y\{ Vl v 2 )(y^* ® W) G . The fact that W G is one dimensional corresponds in biology to the 
assumption of uniform root distribution. The Zariski closure of the image will be denoted 
by X(T,G) and is known to be toric. We also let X-p(T,G) be the projective variety, such 
that X(T, G) is the affine cone over X-p(T, G). 

More generally, one can consider the vector spaces W v to be arbitrary representations. The 
model in such a case is called equivariant - it was introduced in |DK09| . This construction 
does not always lead to a toric variety. Indeed, the first presented example - secants of Segre 
- is an equivariant model with the trivial group. 

The study of equivariant models was reduced to the case of claw trees. Still, the description 
of the varieties associated to claw trees is difficult. Let us present what is known. 

(1) The equivariant model associated to the trivial group corresponds to secant variety 
of the Segre embedding. These varieties are an object of intensive studies. The de- 
scription of the first secant was recently given in |Raillj . solving the GSS conjecture. 
It is known that analogous description, by flattenings, for higher secants does not 
hold. On the other hand, it was proven that, for a fixed k, there exists d, such that 
the k-th secant of the Segre variety, for any number of factors, is set theoretically 
described by equations of degree at most d, |DKllj . 

(2) The group-based model with G = Z 2 is the only model where we can explicitly give a 
description of the ideal |SS05l Theorem 28], |CP07j . It is definitely the simplest model 
one can consider. However, it turned out to be very interesting due to nice properties 
|BW07j and connections with different branches of mathematics [SX101 IMan09j . For 
the generalizations of the construction to graphs we advise the reader to consult 
[BBKMlOj . 



1 We would like to thank Elizabeth Allman for brigning this fact to our attention. 
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(3) Equivariant models for an abelian group G include previous examples. Recently, it 
was proved in |DE] that set theoretically they can be defined in (some) bounded 
degree. 

The following conjecture is due to Sturmfels and Sullivant. 

Conjecture 1.1 (Conjecture 30). The ideal for the 3-Kimura model, that is a group-based 
model for G = Z 2 x Z 2; is generated in degree at most 4 for any tree T. 

One is particulary interested in this model, as it was introduced by theoretical biologists. 

Main Theorem 1. For any tree T the projective scheme X-p(T, Z 2 x Z 2 ) can be defined by 
an ideal generated in degree at most 4. 

Conjecture 11.11 has the following generalization, also due to Sturmfels and Sullivant. 

Conjecture 1.2 (Conjecture 29). For any group G the ideal of the group-based model is 
generated in degree at most \G\ for any tree T. 

Main Theorem 2. For any group G there exists a constant d such that the projective scheme 
Xp(T, G) can be defined by an ideal generated in degree at most d for any tree T . 

The result presented above is quite similar to the main results of |DEj . that generalize 
[DK09j . Let us state 3 major differences: 

(1) The results in |DE] concern a wide class of abelian equivariant models, while ours 
deal only with group-based models; 

(2) Our results are scheme theoretic, while the results in [DEj are set theoretic] 

(3) Our results are constructive, while methods of |DEj are existential. 

In [DEj the authors use a beautiful technique that basis on noetherian arguments in rings, 
that are not finitely generated, but are equipped with an additional monoid action. For more 
on this technique see [HS091 IHdC09| IDralOj . Our methods are typical for toric geometry. 
Indeed, we believe that the presented approach can be used for much broader class of prob- 
lems in toric geometry to determine the degree of generation - see for example [Brullj . In 
particular the methods should cover group-based models for small groups G. As our proves 
are constructive, for group-based models, one can explicitly write a polynomial algorithm 
for testing if a point belongs to X(T,G). The existence of such an algorithm was proven 
in |DEj . Still, the constants in general would be to big for the algorithm to be of practical 
value. However, for the 3-Kimura model, our results are sharp. The elements of the ideal 
defining X(T, G) are called phylogenetic invariants. They are sought by people dealing with 
phylogenetics. 

Corollary 1.3. For any tree T any point P belongs to X(T, Z 2 x Z 2 ) if and only if all 
phylogenetic invariants of degree at most four vanish at P. 

The methods, how to inductively find phylogenetic invariants of a fixed degree were pre- 
sented in jDBMl Section 3.2]. We believe that Corollary O may be used in applied mathe- 
matics. 

In Section [2] we present a toric description of varieties X(T, G). The general method that 
we use can be applied to other problems in toric geometry. It is described in Section HI In 
Section [5] we present a basic combinatorial lemma. It is the main tool to prove Main Theorem 
121 what we do in Section [SJ The reader may notice the resemblance between contractions of 
tensors and methods of this section. The technical proof of Main Theorem [T] can be found 
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in Section [7J In the last Section [8] we present open problems in the language of algebraic 
geometry. 
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2. Constructions and notation 

Let T be a rooted tree and G a finite abelian group. We assume that all edges of T are 
directed away from the root, however we could consider any orientation. 

Definition 2.1 (Edges E, Vertices V, Leaves L, Nodes N). We define the sets E and V to 
be respectively the set of edges and vertieces of the tree T. Vertices of degree one are called 
leaves. The set of leaves is denoted by L. By abuse of notation, edges adjacent to leaves will 
also be called leaves. We define the set of nodes N := V \ L . 

There are two canonical groups one can consider. 

Definition 2.2 (Edge labellings Ge, Vertex labellings Gy, Nodes labellings Gn)- The group 
Ge of edge labellings consists of all functions f : E — >■ G. We define (/l/^Xe) = fi(e)f 2 (e). 
The group Ge is isomorphic to 

The group Gy of vertex labellings consists of all functions g : V — >■ G. We define 
(9id2)( v ) = 9i( v )d2{ v )- The group Gy is isomorphic to G^. 

The group Gn of vertex labellings consists of all functions g : N — >■ G. We define 
(flifl^X^) = gi(v)g2(v). The group Gn is isomorphic to G' N *. 

As the tree T is directed there is a canonical morphism s : Ge — > Gy. 

Definition 2.3 (The summing morphism s). We define s : Ge — > Gy by s(f)(v) = 
Yle=(xv) f( e ) ~ Ee=(«i) /( e )j where the first sum is taken over all edges incoming to v and 
the second over outgoing edges. 

In phylogenetics it is natural to distinguish the set of leaves and nodes. We define the 
natural projection 7Ttv- 

Definition 2.4 (Projection tin). We define the projection tin '■ Gy —> Gn by restricting the 
domain of a function from V to N. 

The most important combinatorial objects that we consider are the following. 

Definition 2.5 (Group-based flows). We define the group Q to be the kernel of the compo- 
sition ir n o s : Ge — > Gn- Hence the elements of Q are such edge labellings that "the signed 
sum around each node is trivial". This is a well-known condition for a flow, but instead 
of labelling edges with numbers, we label them with group elements. We call elements of Q 
group-based flows. We have got an exact sequence: 

1 -> g -> G E -> G N -»■ 1. 

As an abstract group we have an isomorphism Q = G' L ' _1 . 

Notice that in the definition above the role of sinks and sources is played by leaves. This 
construction can easily be generalized to other subsets of vertices. 
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Example 2.6. Let us consider the group G = Z 3 and the following tree: 

o 




Here 62, e%, and e§ are leaves. An example of a flow is an association e± — > 2, &2 — > 1, 
e 3 1, e 4 -)> 2, e 5 2. 

Remark 2.7. Eventually we will see, Definition 12.121 and 12.151 that group-based flows 
correspond to vertices of the polytope associated to the toric variety. Such objects were first 
introduced only for the group Z 2 and trivalent trees in [BW071 Definition 3.1] and were called 
networks. They were originally defined as pairwise disjoint paths, beginning and ending at 
leaves. Definition 12.51 is a direct generalization. One can associate to edges from paths an 
element 1 G Z2 and to all other edges £ Z2 obtaining a flow from a network. 




It is worth noting, that as each element of the group in this case has order two, the definition 
of the flow is independent from the orientation of the tree. 

It is well-known that, if the graph is a tree, it is enough to know the values at sources and 
sinks to reconstruct the flow. Of course, one cannot take arbitrary values for sources and 
sinks. For any flow, the sum over sources and sinks has to be equal to zero. 

Definition 2.8 (Sockets S). We define the group of sockets S whose elements are such 
functions f : L — >• G that YlieL /(0 = e o> where cq is the neutral element of the group. 

Notice that by restricting a flow to leaves one obtains a socket. In fact one can easily 
prove the following. 

Proposition 2.9. The groups Q and S are naturally isomorphic. □ 

Remark 2.10. As it was for flows also sockets were first defined in |BW07| Definition 
3.1] for trivalent trees and the group Z2. They were defined as even subsets of leaves. We 
see that the condition that the subsets are even corresponds to the summation condition in 
Definition E3 

We come to the definition of the polytope P that represents the variety A P (T, G). This 
will be a lattice polytope in the lattice M E . 

Definition 2.11 (Lattice Me, basis fr( e ,g))- We define Me as the lattice with basis elements 
indexed by all pairs (e,g) G E x G. The basis element indexed by {e,g) is denoted by 
b( e ,g) e M E . 

Definition 2.12 (Polytope P). To a flow f G Q one can naturally associate an element 
Pf := ^2 eeE fye,/(e)) £ Me- This is an element whose coordinates are equal to either 1 or 0. 
We define P to be the convex hull of all the points Pf over all group-based flows f . 
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The following proposition follows from the fact that P is a subpolytope of the unite cube 
in M E . 

Proposition 2.13. All integral points of P are vertices and are of the form Pf for certain 

flow f eg. □ 

Definition 2.14 (Semigroup S(P), Semigroup algebra C[P]). We define the monoid S(P) 
as the submonoid of Me spanned by vertices of P. We define C[P] as the semigroup algebra 
associated to the monoid S(P). Precisely C[P] is a complex vector space with basis elements 
identified with elements of S(P). The multiplicative structure on C[P] is induced from the 
additive structure on S(P). 

Definition 2.15 (Variety X(T,G)). Notice that C[P] is a graded algebra, taking vertices of 
P to be of degree 1. We define the projective variety X-p(T, G) as Proj C[P]. We also define 
X(T,G) = SpecC[P]. 

It is not obvious that the variety X(T, G) agrees with the one defined in the Introduction. 
Indeed, it is a nontrivial theorem - |SS05[ IMicllj . 

Remark 2.16. The variety Xp(T,G) is not the toric variety associated to the polytope 
P as defined for example in |Ful93j Section 1.5]. The variety X P (T,G) does not have to 
be normal - for example when G = Z 6 |DBM[ Computation 4.3]. In fact, the toric variety 
associated to the polytope P is the normalization of Xp(T, G). We will be mostly interested 
in properties of X(T, G). As the construction of X(T, G) is motivated by applications, there 
are not many results on the normalization, that we find an interesting object for further 
studies. 

Consider the vector space V with basis elements indexed by vertices of P. There is a 
natural embedding of Xp(T, G) in the projective space P(V). The choice of the basis induces 
a toric structure on P(V). The projective space F(V) contains a dense algebraic torus T. 
The intersection of T with Xp(T, G) is also an algebraic torus that acts, with a dense orbit, 
on Xp(T, G). We are interested in the ideal J(T, G) that defines Xp(T, G) in P(V). 

The following crucial fact is well known in toric geometry. 

Proposition 2.17 QStu96j Lemma 4.1). Let A be any finite set in a lattice M. Define the 
corresponding ideal I a as in [Stu96| Chapter 4] . The toric ideal I a is spanned, as a vector 
space, by the set of binomials 

where Y2 a iAi = Y biBi encodes an integral relation between points Ai,B>i £ A. □ 

We apply this proposition to A = P. We see that the description of the ideal I(T, G) is 
reduced to the study of integral relations between the vertices of the polytope P. 

2.1. Group based polytopes. So far we have seen that the group Q and polytope P are 
strongly related. We believe that the object consisting of such a pair deserves a separate, 
general definition. 

Definition 2.18 (Group-based polytope). Let Q be a lattice polytope in the lattice M. 
Suppose that a group G acts on the integral points of Q. We say that the pair (Q,G) is a 
group based polytope if the G action preserves the linear relations. That is for any relation 
Yli = Y Qj an d an V g £ G we have Y <?(%) — Y 9( a j)> where q i: qj £ Q are lattice points. 
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As before, the polytope Q generates a monoid S(Q) which defines the semigroup algebra 
C[Q] -cf. Definition 12.141 The toric variety SpecC[Q] has a natural embedding in the affine 
space V, with basis elements corresponding to lattice points of Q. As G acts on Q we see 
that V is a representation of G. The assertion in Definition 12 . 1 81 that the G action preserves 
the linear relation is equivalent to the fact that the action restricts to the variety SpecC[Q]. 

It is straightforward to see that (P, Q) is a group based polytope. Moreover as the action 
of Q on vertices of P is transitive and free we obtain the following proposition. 

Proposition 2.19. The natural ambient space V of the variety SpecC[P] is the regular 
representation of the group Q. The induced action on ¥{V) restricts to the variety Ap(T, G). 
□ 

Remark 2.20. In fact, canonically one should define flows as labellings of edges by charac- 
ters of G, that is elements of G*. As the groups G and G* are isomorphic, in all constructions 
we get isomorphic objects. We decided not to introduce G* to avoid unnecessary confusion. 

3. General methods in phylogenetics 

Although the construction from previous section takes several steps, still all the data that 
was used was the tree T and the group G. Thus one would expect to find certain relations 
between invariants of the ideal I(G, T), the group G and the tree T. 

Remark 3.1. An example of such a relation, in a slightly different setting, was presented 
in [BBKMIO] . The genus of the graph plus one was proved to be the upper bound on the 
degree of generation of an algebra associated to any graph and the group Z 2 . 

Definition 3.2 (Claw tree Kx jn ). The claw tree K\ >n is the tree with one node and n leaves. 

We introduce certain invariants that are of great interest [SS05[ Chapter 5]. 

Definition 3.3 ((j)(G, T), <j)(G), 4>{G, n)). Let 4>(G, T) be the (minimal) degree in which I(G, T) 
is generated. Let <f)(G,n) := 4>(G,Ki tn ). Let <f>(G) = sup<j)(G,T), where the supremum is 
taken over all trees. 

Let us restate the conjecture of Sturmfels and Sullivant. 

Conjecture 3.4 ( [5505] . Conjecture 29). We have <p(G) <\G\. 

It was separately stated in the most interesting case for G = 1,2 x Z2, |SS05[ Conjecture 
30]. Note that, a priori, we do not know if 4>(G) is finite, and indeed it is an open problem 
for groups different from Z2. 

Let us present the combinatorial way of representing group-based flows and relations 
between vertices of P. Using the isomorphism Ge — G^ E \ each element of Ge can be 
represented as a sequence of group elements of length \E\. We represent group-based flows, as 
such sequences, presented as column vectors. A multiset of group-based flows is represented 
by a matrix. Each column corresponds to a flow and the rows are indexed by edges of the 
tree T. Recall that flows correspond to vertices of P. Consider any combination a^j 
where Vi 6 P are vertices of P and ctj G N. Such a combination can be encoded as a matrix, 
where each flow corresponding to V\ appears ai times. Of course such matrices should be 
considered up to permutation of columns. 

Continuing this, we may encode a relation Y^ a i v i = Yl a j v j as a P & i r °f matrices Ai,A 2 
of the same size such that: 
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(1) rows are indexed by edges, 

(2) each column represents a network, 

(3) for each edge e the e-th row in A\ is equal to the e-th row of A 2 up to permutation. 
Example 3.5. Consider the following tree and numbering of edges: 

(3.1) 




Take G = Z 2 . An example of a nontrivial relation between flows is as follows: 
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One of the main methods in algebraic phylogenetics is reduction to simpler trees. Each tree 
T, different from the claw tree can be subdivieded into two simplier trees T\ and T 2 , by 
dividing an inner edge. 




The operation can be reversed. If we have got two trees with distinguished leaves we can 
join them, obtaining one tree. 

Often one can transfer certain results from the trees T\ and T 2 to the tree T. This is the 
case for the degree of generation. We encourage the reader to proof the following result using 
the language of flows. 

Proposition 3.6. Suppose that we have got two trees Ti and T 2 with distinguished leaves 
and that the tree T is obtained by joining them. Then 4>(G,T) < max(0(G, Ti), 4>(G, T 2 )).D 

In fact one can derive the generators of the ideal I(G,T) from the ideal J(G,Ti) and 
I(G,T 2 ), [SS051 Theorem 26], [Sul07l Corollary 2.11]. Thus if we want to bound I(G, T) for 
any tree T it is enough to consider claw trees. 

Remark 3.7. The "reduction to claw trees" has been obtained in a much more general 
setting then presented in this paper |AR08t IDK09] . Still the problem of describing the ideal 
for claw trees is extremely hard |DK09t p. 17], apart from the simple case G = Z 2 |CP07j . 

In algebraic geometry varieties can be compared on different levels. The most common 
distinction is to compare them as schemes or as sets. Consider two ideals Ji,/ 2 in a ring 
of polynomials C[xi, . . . ,x n }. It is well-known that they define the same affine scheme if 
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and only if they are equal. By Hilbert's Nullstellensatz we know that they define the same 
set if and only if their radicals are equal. We call the first equality ideal theoretic and the 
second one set theoretic. If the ideals are homogenous they define not only affine schemes, 
but also projective schemes. Let m = (xi, . . . ,x n ) be the so called irrelevant ideal. The 
homogenous ideals I\ and I2 define the same projective scheme if and only if their saturation 
with respect to the maximal ideal are equal, that is I\ : m°° = J 2 : m°°. Recall that 
I : J°° = {/ : fj C / for some N large enough} is called saturation of / with respect to J. 
We call equality of ideals after saturation scheme theoretic. Of course equality of projective 
schemes implies equality of projective sets. So if both I\ and I2 are simultaneously either 
contained or not in m then scheme theoretic equality implies set theoretic. 

Example 3.8. Consider C[x,y] and three ideals I\ = (x,y), I2 = (x 2 ,xy,y 2 ), I3 = (1). All 
of them define the same (empty) projective scheme. Their saturation is I3. The first two 
define a nonempty affine set, supported at 0, with different scheme structure. 

4. Idea 

Definition 4.1 (Subideal I^)- For any graded ideal I we define the subideal generated by all 
elements of degree at most d by 1$. 

Let us start from general remarks on toric geometry. Consider a lattice polytope Q 
contained in a hyperplane H not passing through 0. This allows us to put a grading to the 
associated monoid generated by Q and hence on the associated toric ideal Iq. Suppose that 
we want to prove that Iq and Iq<i for some d G N define the same projective scheme. Let X 
be the projective variety defined by Q. The integral points of Q correspond to coordinates of 
the ambient projective space of X. Proving that the saturation of iQd equals Iq is equivalent 
to proving that IQd and Iq are equal in each localization with respect to any coordinate, 
represented by a lattice point R G Q. Thus we have to prove that any generator of Iq 
multiplied by a sufficiently high power of the variable corresponding to R belongs to Iq^. 

Let us translate this condition to combinatorial language. The generators of Iq correspond 
to relations between points of Q. Let us fix a relation Y A$ = Y^ji where Ai,Bj G Q. 
Multiplying the corresponding element of the ideal by the variable corresponding to R is 
equivalent to adding R to both sides of the relation. Thus we have to prove that the 
binomial corresponding to the relation Y A4 + mR = Y B j + mR is generated by binomials 
from Iq of degree at most d for m sufficiently large. 

A binomial corresponding to a relation R% = Y^i between points of a polytope is 
generated in a degree d if and only if one can transform Y R-i to Y &i using a sequence of 
simple steps. In each single transformation one can replace points R±, . . . , Rk for k < d by 
R[, . . . , R' k if they satisfy the relation Yi=i = Yi=i R'i- m such a case we say that the 
relation is generated in degree d. 

The proof scheme is very simple: 

(1) Using degree d relations reduce A iy Bi to some simple, special points of Q contained in 

a subset L C Q. (*) 

(2) Show that any relation between the points of L is generated in degree d. 

In general any of this two points can be very difficult. 

Remark 4.2. It is well known that the projective toric variety defined by a polytope Q 
is covered by affine subsets given by localizations by variables corresponding to vertices. 
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Thus one can be tempted to prove that Iq = Iq& only in the localizations by variables 
corresponding to vertices. Note however that in general, we do not know if the scheme 
defined by Iq^ is also covered by open sets given by localizations by variables corresponding 
to vertices. Indeed, Iq^ and Iq may be different on the set-theoretical level. For example if 
Proj Iqcl contains a point that is zero on the coordinates corresponding to vertices and nonzero 
on some other coordinates, then such a point will not belong to any open set corresponding 
to a localization with respect to vertices. However, if Rad/Q^ = Iq, then of course it is 
enough to consider localizations with respect to vertices. See also |Brullj . 

As our polytopes P have only vertices, the problem described in Remark 14.21 does not 
concern us. We have the following equivalences for a toric ideal Iq given by a polytope Q. 

• All relations between vertices of Q are generated in degree d the ideal / is generated 
in degree d. 

• For any point R G Q and any relation there is an integer m such that after adding 
mR to both sides of the relation, it is generated in degree d <^ the projective scheme 
defined by Iq can be defined by an ideal generated in degree d. 

Lemma 4.3. Suppose that P is a group-based polytope with G acting transitively on its 
integral points. The projective scheme Proj C[P] can be represented by an ideal generated 
in degree at most d if and only if there exists a point R G P such that for any relation 
'YjAi = between points of P for m sufficiently large ^2 A, + mR = + mR is 

generated in degree at most d. 

Proof. As G acts transitively the assumption that there exists a point R with the given 
property is equivalent to the fact that the property holds for all integral points of P. □ 

5. Combinatorial lemma 

Definition 5.1 ([n]). For any n G N let [n] be the set of natural numbers from zero to n. 

Definition 5.2 (Coloring). A coloring of length n is a function f : [n] — > [i]. The number i is 
called the number of colors. The support of the coloring f is defined as {k G [n] : f(k) ^ 0}. 

Definition 5.3 (Transformation). Consider two colorings /i,/2 : [n] —> [i]. Suppose that 
there exist two numbers < ki,k2 < n such that kj is not in the support of f r Moreover 
suppose f\(k 2 ) = f2{ki). Define f'Ax) = fj(x) for x ^ ki,k 2 . Moreover f[{ki) := /^(^i) and 
f^iki) := fi(ki). We call f[,f 2 a transformation of fi and f 2 . 

Transformation corresponds to exchanging the fixed color in two colorings with a "blank" 
color. A set of colorings can be transformed into another by choosing two colorings and 
transforming them. We generate an equivalence relation on colorings by transformations. 
Abusing the notation the relation is also called transformation. 

Lemma 5.4. Let us fix two natural numbers i (number of colors) and s (bound on the 
support). Fix e > 0. There exists N G N, such that for all n > N any collection of colorings 
fi, . . . , f m : [n] — > [i] with support of cardinality at most s can be transformed into a collection 
f[, ■ ■ ■ , fm such that there exist (1 — e)[|] even numbers x < n, such that for any f[ at least 
one of the numbers x, x + 1 is not in its support. 

Note that the statement is nontrivial, as the number m of colorings is not bounded. 
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Proof. The proof is inductive on s. For s = 1 we can take all even numbers smaller then n, 
thus the statement is trivial. 

Assume that the statement is true for s. We will prove it for s + 1, proceeding inductively 
on g. Fix e. 

Suppose g = 1. By induction hypothesis consider N' for s, g = 1 and e' = e/2. Consider 
N = k(N' + 1) for large k. We call a coloring bad if its support is entirely contained in 
some interval < m(N' + 1), (m + 1)(N' + 1)). If we have two bad colorings with supports 
in different intervals we can transform them to two colorings that are not bad. Thus we can 
assume that all bad colorings are contained in one interval. By the induction hypothesis, we 
can transform the colorings, so that in each other interval, we can choose (1 — e/2) [^-] even 
numbers. We see that for k large enough the theorem holds. 

The case for greater g is only slightly more complicated. By induction, we choose N' that 
holds for e/(g + 2), both in case when g or s is smaller. Consider N = k(N' + 1). We call a 
coloring bad if both: 

• its support is contained in some interval < m(N' + 1), (m + 1)(N' + 1)), 

• all g colors appear in its image. 

As before we can assume that all bad colorings are contained in one interval, say the first 
one. For any other interval, all colorings that are not bad can be divided into (not necessarily 
disjoint) subsets: 

• g subsets depending on the color that is missing in the image, 

• one subset containing colorings with supports not contained in the given interval. 

For each coloring we choose one subset in which it is contained and fix this choice. By 
induction, for each interval apart from the first one, for each subset of colorings, there are 
at most (e/(g + 2))[y] even numbers that do not satisfy the theorem. Hence, the theorem 
holds for k large enough. □ 

6. Bounding the degree of phylogenetic invariants 

Definition 6.1 (Support of a flow). Let n be any flow. The set of edges to which n associates 
a nonneutral element is called the support of n. 

Lemma 6.2. Consider a claw tree K\^ n and a finite abelian group G. Let nt be the neutral 
element in the group of flows. For any relation Ai = ^2 E>i of integral points of the 
associated polytope P there exists m such that the relation mnt + s ^ l A i = mnt + A can 
be transformed, using only quadrics, to a relation among flows with support at most \G\ — 1. 

Proof. Consider any flow A. If its support is of cardinality greater that |G| — 1 we can find a 
strict subset S of edges such that ^2 e€S A{e) is the neutral element of G, where the addition 
is taken in G. Thus nt + A equals the sum of two flows with strictly smaller support. The 
lemma follows easily. □ 

Lemma 6.3. For any finite, abelian group G there exists N, such that if the ideal I(K\ t N, G) 
is generated in degree d, then the scheme X(K l m , G) can be defined by an ideal generated in 
degree d for any m. 

Proof. Let us choose iV from Lemma [5.41 for % — s — \G\ — 1 and e < 1/2. First let us prove 
the following claim: 

Any relation between flows of length m with support of cardinality at most |G| — 1 is 
generated in degree d. 
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Let us prove it inductively on m, starting from m = N, where the assumption holds. Note 
that each flow can be considered as a coloring; we number group elements from to |G| — 1, 
assigning zero to the neutral element. As e < 1/2 we can find an even number x, such that 
x or x + 1 is not in the support for any flow appearing in the relation. Let us replace each 
flow / appearing in the relation by: 



We obtain a relation between flows /' of length m — 1. Thus, by induction, it is generated 
in degree d. Note that each generation step can be lifted to the relation between /, what 
proves the claim. 

By multiplying any relation by the flow nt in a sufficiently high power one can reduce any 
relation to relation between flows with support of cardinality at most \G\ — 1 using quadrics, 
Lemma 16.21 As flows act transitively on P, by Lemma 14.31 it is enough to consider relations 
multiplied by a high power of nt. Thus the lemma follows. □ 

Remark 6.4. The constant N can be explicitly computed. Moreover due to standard the- 
orems, a toric variety in bounded dimension and bounded exponents of defining monomials 
has got an explicitly bounded degree of generation - |Stu96l Theorem 4.7, Corollary 4.15]. 
Thus Main Theorem^ follows. 



The whole Section is devoted to the proof of Main Theorem [TJ 

Theorem 7.1. For any tree T the ideal I(T, Z 2 x Z 2 ) and the subideal generated in degree 
at most four define the same projective scheme. 

Following the arguments of [SS05[ Chapter 5] one immediately reduces to the case when 
T = Kij for certain I G N. Let us restate the general definitions already introduced, in case 
of the group G = Z 2 x Z 2 (the 3-Kimura model) on a claw tree Kij. 

Definition 7.2 (Flow, ideals / and I4, polytope P). A flow is an association of elements of 
the group Z 2 x Z 2 to edges of K\x, such that the sum of all elements is the natural element. 
Let I be the ideal of the variety X(Kij, Z 2 x Z 2 ) and let I4 be the subideal generated in degree 
4. Let P be the corresponding polytope. 

We will identify a flow with an /-tuple of group elements summing up to zero. The sum 
of such /-tuples will be a coordinatewise sum, where each entry is treated as an element of 
the free abelian group generated by elements of Z 2 x Z 2 . Each flow represents a vertex of a 
polytope P. The addition described above is the addition in the lattice generated by vertices 
of the polytope. 

Example 7.3. For / = 4 we can add: 




7. Proof of the main theorem [TJ 



(0, (1, 0) + (1, 1), 2(0, 1), -3(0, 0)) + ((0, 1), (1, 0) - (1, 1), (1, 0), (0, 0)) 



= ((0,1), 2(1,0), 2(0,1) + (1,0), -2(0,0)). 
The trivial flow is ((0, 0), (0, 0), (0, 0), (0, 0)). 
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Definition 7.4 (Pair, triple). We say that a flow is a pair if and only if the cardinality of 
the support is equal to two. We say that a flow is a triple if and only if the cardinality of 
the support is equal to three. 

By Lemmas 14.31 and 16.21 we have to generate relations only between flows that are pairs 
and triples. This completes the first step of the method (*) presented in Section HJ The set 
L consists of pairs and triples. Note that this part of the proof can be adjusted to other 
groups G, for L consisting of flows with support of cardinality at most \G\ — 1. 

Let us fix any relation Yl n i = ^2 n 'ii where n; and n\ are either pairs or triples. Our aim 
is to transform rij to V n\ in a series of steps, each time replacing at most four by 
flows with the same sumj. We assume that among there are more or the same number of 
triples as among n\. We first try to reduce the relation, so that consequently: 

(1) Among rij there £1X6 cLS few triples as possible, 

(2) Among n\ there are as few triples as possible, 

(3) The degree of the relation is as small as possible. 

More precisely let t and t' be the number of triples among respectively and n^. Let d be 
the degree of the relation. Our proof will be inductive on (t, t', d) with lexicographic order. 

To prove Theorem 17.11 we consider separately three cases depending on the number of 
triples among rij. The cases are: 

a) there are no triples, 

b) there is exactly one triple, 

c) there are at least two triples. 

We say that a family of flows agrees on an index j of an edge if they all associate the 
same element to j and j belongs to their support. We will denote by g±, $2 and g% the three 
nontrivial elements of Z2 x Z2. A triple that associates g\ to index a, $2 to index b and g% 
to index c is denoted by (a, b, c). A pair that associates an element gi to indices d and e will 
be denoted by (d, e) 9i . We say that ^ is contained in a flow if there exists an index j, such 
that the flow associates gi to j. We believe that the following proofs are impossible to follow 
without a piece of paper. We strongly encourage the reader to note what flows appear on 
both sides of the relation at each step of the proof. 

7.1. The case with no triples. First note that there are no triples among n[. Without 
loss of generality we may assume that n\ is a pair equal to (a, b) gi . Hence there exists n[, say 
n[, that is (b, c) gi for some index c. If c = a we can reduce this pair, hence we assume c 7^ a. 
There exists a flow, say n2 that is (c, d) 9l . If d = b we can reduce this pair. We consider two 
other cases: 

1) d 7^ a. Then we use the degree two relation (a, b) gi + (c, d) gi = (a, d) gi + (6, c) gi and we 
can reduce (6, c) gi . 

2) d = a. Then there is a flow, say n' 2 given by (a, e) gi . If e = b or e = c we can reduce 
this pair. In the other cases we use the relation (a, e) gi + (b, c) gi = (a, b) gi + (e, c) gi and we 
reduce (a, b) gi . 

Notice that in this very easy case we have only used degree two relations. 

7.2. The case with one triple. Let n\ be the only triple among rij. 
Lemma 7.5. There is exactly one triple among n^. 



2 Wc are also allowed to add the flow nt to both sides. 
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Proof. Due to the assumptions we know that there is at most one triple among n-. We 
exclude the case when there are no triples by comparing the parity of the number of times 
the element g\ appears on both sides of the relation. □ 

Due to the previous lemma we may assume that n! x is the only triple among n\. Let 
ni = (1,2,3). ' 

7.2.1. Case: the triples agree on at least two elements in their support. Consider the case 
where triples agree on two elements, say 1 and 2, that are in their support. Suppose that 
ri\ = (1, 2, 3) and n[ = (1, 2, c). Of course if c = 3 we can make a reduction. In other case 
we must have a pair (c, d) 93 among m. If d ^ 3 then we use the relation (c, d) 93 + (1, 2, 3) = 
(1, 2, c) + (3, d) 93 and reduce the triples. Hence d = 3. Analogously there must be a pair 
(3, c) g3 among n' i: hence we can reduce this pair. 

7.2.2. Case: the triples agree on exactly one element in their support. Consider the case 
when triples agree on at least one element, say 1, in their common support. By the previous 
case we may assume that they agree on exactly one element. 

As before let n\ = (1, 2, 3) and n[ = (1, b, c). We consider three cases. 

1) 6^3. 

There must be a pair (6, d) 92 among n^. If d ^ 2 then we can apply the relation (b, d) 92 + 
ri\ = (1, b, 3) + (d, 2) 92 . This reduces to the case 17.2.11 So we assume d = 2. There must be 
a pair (2, e) 92 among n^. Hence there must also be a pair (e, f) 92 among rij. If / ^ b we can 
use a relation (e, f) 92 + (2,b) 92 = (e, 2) 92 + (f,b) 92 and reduce (e, 2) S2 . For / = b we must 
have a pair (b,g) 92 among n\. If g = 2 or g = e then this pair can be reduced. In the other 
case we use the relation (e, 2) 92 + (6, g) g2 = (e, g) 92 + (b, 2) 92 and reduce (6, 2) 92 . 

2) c ^ 2. 

This case is analogous to 1). 

3) b = 3 and c = 2. 

Lemma 7.6. // t/iere is a pair (p,q) g2 among n i; such that p, q ^ 2 then we may assume 
that it is equal to (1,3). 

Proof. Suppose that p ^ 1, 3 and q ^ 2. We apply a relation (p, g) 52 + m = (l,p, 3) + (g, 2) 92 
and reduce to case 1) b ^ 2. □ 

Analogously if there is a pair (p,q) g2 among r^, such that p, q ^ 3 then this pair equals 
(1,2)*. 

Notice that there must be a pair (3, d) 92 among and a pair (2, e) 92 among n\. From 
Lemma [7.61 d equals 2 or 1 and e equals 3 or 1. We will consider subcases. 

3.1) Suppose that d = 2. 

If e = 3 then we can make a reduction of pairs. If e = 1 we must have a pair (1, f) 92 
among rij. If / = 2 we make a reduction, hence we assume / = 3. This means that there 
must be a pair (3,p) 92 among n[. If g = 2 or g = 1 we can make a reduction. Otherwise we 
apply the relation (l,2) ff2 + (3, g) 92 = (1,3) 92 + (2,g) 92 and reduce the pair (1,3) 92 . 

3.2) Suppose that e = 3. 
This case is similar to 3.1). 

3.3) Suppose that d — 1 and e = 1. 

As this is the only case left we may repeat the same reasoning for g^. In particular, we 
must have a pair (1, 2) 93 among rij. We see that we can reduce the triples by applying the 
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following relation: 

(1, 2, 3) + (1, 3) 52 + (1, 2) 93 = (1, 3, 2) + (1, 2) g2 + (1, 3) fl3 . 
This is a degree three relation. 

7.2.3. Case: The triples do not agree on any element of the support. We want to reduce to 
one of previous cases. Consider the following two cases. 

1) The triples n\ and n[ have different supports. 

Once again let (1,2,3) = n\ and let (a,b,c) = n[. We may assume that a is not in the 
support of n\. We see that there must be a pair (a, f) gi among rij. If / ^ 1 we can use a 
relation (a, f) gi + n\ = (a, 2, 3) + (/, This reduces to the case \7.2.2\ hence we assume 
that / = 1. There must be a pair (g, l) gi among n' { . If g = a we can reduce this pair, so 
we assume g ^ a. Notice that there must be a pair (g,h) gi among rij. If h ^ a, then we 
can use relation (1, a) gi + (g, h) gi = (g, l) gi + (h, a) gi and reduce the pair (g, l) gi . So we can 
assume h = a. Then there must be a pair (a, i) gi among n\. If i — 1 then we can reduce it. 
Otherwise we can use the relation (g, l) gi + (a,i) gi = (g,a) gi + (l,i) gi and reduce the pair 

2) The set {1,2,3} is the support of n[ andn\. 

Remember that due to the assumption 17.2.31 the triples n\ and n[ do not agree on any 
element from their support. Without loss of generality we may assume ri\ = (1,2,3) and 
n[ = (2, 3, 1). Hence there must be a pair (2, a) gi among n, and (1, b) gi among n\. If a = 1 
and 6 = 2 then both pairs are the same and can be reduced. As both cases are symmetric 
we can assume that a / 1. 

If a 7^ 3 we can use the relation (2, a) gi + (1, 2, 3) = (a, 2, 3) + (2, l) gi . This reduces to the 
case with different supports. We are left with the case a = 3. There must be a pair (3, z) gi 
among n^. If z ^ 1 we can use the relation (3, z) gi + n[ = (z, 3, 1) + (2,3) 9l . This would 
enable to reduce the (2, 3) 9l pair and decrease the degree. So we can assume that z = 1. So 
far we have shown that there must be pairs (2, 3) Sl among rij and (3, l) gi among n|E By the 
same reasoning for gi and g% we see that we can use the following relation: 

(1, 2, 3) + (2, 3) S1 + (1, 3) 92 + (1, 2) g3 = (2, 3, 1) + (2, 3) fl3 + (1, 3) gi + (1, 2) g2 . 

Notice that this is a degree four relation. It enables us to reduce triples. 

7.3. The case with at least two triples. We suppose that there are at least two triples 
among rij. 

Lemma 7.7. If there are two triples n\, n 2 among rii that do not agree on any element 
of their supports then we can make a reduction. Thus we can assume that any two triples 
among ni agree on at least one index. 

Proof. The assumptions are equivalent to n\ = (a,b,c), n 2 = (d,e,f) with a ^ d, b ^ e, 
c 7^ /• We apply the relation n\ + n 2 + nt = (a, d) gi + (b, e) g2 + (c, f) g3 that reduces the 
number of triples. □ 

Lemma 7.8. If there is no index on which all triples from agree then we can make a 
reduction. 



Notice that we have made a symmetry assumption u^l. The symmetric assumption would be b ^ 2. 
However as the result we got was symmetric, also for b ^ 2 we prove the existence of the same pairs. 
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Proof. Suppose there is no index on which all n t agree. We may consider only two cases due 
to Lemma [7.71 

1) Suppose that any two triples from n« agree on at least two elements. 

Consider any triple rii = (1, 2, 3). Due to the fact that not all triples from associate g± 
to 1 there is a triple (a, 2, 3) with a ^ 1 among rij. There also must be a triple that does not 
associate g 2 to 2. It cannot agree both with (1, 2, 3) and (a, 2, 3) on two indices. 

2) There exist two triples that agree only on one index. 

Let rii = (1,2,3) and n 2 = (1,6, c) with 6^2 and c / 3. Due to the case assumption 
there is a triple 713 = (d, e, /) with d 7^ 1. Remember that any two triples have to agree 
on at least one element due to Lemma 17.71 Hence without loss of generality we can assume 
e = 6 and / = 3. We can apply the relation: 

ni + n 2 + n 3 + nt = (d, l) gi + (2, 6) 92 + (3, c) gz + (1, 6, 3), 

that reduces the number of triples. □ 

Due to the previous lemma we may assume that there exists an index, say 1, such that all 
triples among rii associate to it the same nonneutral element, say g\. 

Definition 7.9 (k). Let k be the number of indices on which all triples among rii agree. We 
know that 1 < k < 3. 

We proceed inductively on k, as for k = we already know how to reduce the relation. 
Hence from now on decreasing k is also a reduction. 

Lemma 7.10. Suppose that all triples from associate gj to an index I. If there is a pair 
(x, y) 9j among Hi with I 7^ x,y then either {I, x, y} is the support of all triples among rii or 
we can make a reduction. 

Proof. To simplify the language assume gj = g\ and 1 = 1. Suppose that there is a triple 
rii = (1,6, c) with the support different from {l,x,y}. We can assume x 7^ 6, c. We apply 
the relation ni + (x, y) gi = (x, 6, c) + (1, y) gi what reduces k. □ 

Lemma 7.11. Suppose that all triples from n« associate gj to an index I. If all pairs (x, y) 9j 
among Hi have I in the support then we can reduce all such pairs. 

Proof. Let t be the number of triples among n^. Let p be the number of gj pairs among rii. 
Let t[ and t' 2 be the number of triples in n' { that respectively assign or do not assign gj to 
/. Let p[ and p' 2 be the number of gj pairs among n- that respectively have or do not have / 
in the support. We know that t > t[ + 1' 2 . Comparing the number of times gj appears in rii 
and n't we get: 

t + 2p = t , 1 +t' 2 + 2{p , l +p , 2 ). 
Comparing the number of times gj appears on index I we get: 

t + p = t' 1 +p' 1 . 

This forces t 2 = p' 2 = 0, t = t\ and p = p[. Hence all gj pairs and triples among rii and n[ 
must assign gj to /. Hence the multisets of pairs must be the same for and n\. □ 

Lemma 7.12. If there are g\ pairs among n i; then we can make a reduction. 
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Proof. We will prove that there are no pairs (a, b) gi among rij that do not have 1 in the 
support. Due to Lemma [7. Ill this will finish the proof. Suppose that there is a pair (a,b) gi 
among with a,b ^ 1. Due to Lemma [7. 101 all the triples among rii must have the support 
{1, a, b}. So either k = 1 or k = 3. If k — 1 we can apply the relation 

(1, a, b) + (1, b, a) + (a, b) gi + nt = (1, a) gi + (1, b) gi + (a, b) g2 + (a, b) g3 . 

This reduces the number of triples. Thus we can assume that all triples among n; are equal 
to (1, a, b). 

Claim: Consider any pair (c,d) g2 among n^. We can assume that its support is contained 
in {1, a, b}. 

Proof of the Claim. Suppose this is not the case, that is c ^ {l,a,b}. Due to Lemma [7.101 
we can assume d = a. 

1) Suppose that there is a g 2 pair among Hi that does not contain a in the support. 

It must be equal to (1, b) g2 due to Lemma !?. 101 We can apply the relation (1, b) g2 + (a, c) g2 = 
(c, l) g2 + (a,b) g2 . Applying once again Lemma 17.101 to the pair (c, l) g2 we can make a 
reduction. 

2) All (/2 pairs among ni contain a in the support. 

Due to Lemma 17.111 we can make a reduction. □ 

Thus the support of all g 2 pairs among is contained in {1, a, b}. The same holds for gi 
and #3 pairs. Thus all flows among rii have support contained in {l,a,b}. Hence the same 
must hold for n! si . So our relation is a relation only on three indices. It is well known that 
the ideal for a tree with three edges is generated in degree 4, so in particular the considered 
relation is generated in degree 4. □ 

Corollary 7.13. // all triples among ni associate gj to an index I, then there are no g,j pairs 
among rij. Consequently there are no gj pairs among n\ and all triples among n[ associate 
gj to I. Moreover the number of triples among equals the number of triples among n\. □ 

We reduced to the case were there are no g\ pairs neither among ni nor n^. Moreover, 
there is the same number of triples among ni and n\ and they all associate g\ to 1. 

Lemma 7.14. // all the triples among ni and n\ have support in {1, 2, 3} then we can make 
a reduction. 

Proof. In this case k = 1 or k = 3. If k = 1 then among there is a triple (1,2,3) and 
(1,3,2). One of this triples can be reduced. If k — 3 there are no pairs. All triples among 
Hi and n! i are equal, thus the relation is trivial. □ 

7.3.1. Case: k = 1. We first consider the most difficult case k = 1. As always let ni = 
(1, 2, 3) and n[ = (1, b, c). As the proof is quite complicated we decided to include the tree 
that describes most important cases. While reading the proof we encourage the reader to 
follow at which node we are. The proof is "depth-first, left-first". 
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We start with the left node in the second row - assume 6 = 2. 

We move to the most left node in the third row - suppose that there is no g% pair among 
rii that has got c in the support and, symmetrically, there is no g% pair among that has 
got 3 in the support. There must be a triple (1, e, c) among rij. If e 7^ 3 then we apply the 
relation (1, 2, 3) + (1, e, c) = (1, 2, c) + (1, e, 3) and reduce the triple (1, 2, c). We have e = 3. 
Analogously among there must be a triple (l,c, 3). Hence there must be either a pair 
(c, /) 92 or a triple (1, c, g) among rij. 

We continue to the most left node in the fourth row - suppose that there is a pair (c, f) 92 . 
If / 7^ 2 we apply the relation (1,2,3) + (c, /) 92 = (l,c, 3) + (/, 2) 92 and reduce the triple 
(1, c, 3). If / = 2 we apply the relation (1, 3, c) + (c, 2) 92 = (1, 2, c) + (3, c) 92 and reduce the 
triple (1, 2, c). 

Hence we can assume that there is a triple (1, c, (?) among n; - second node in the fourth 
row. If (7 7^ 2 then we apply the relation (1, c, g) + (1, 2, 3) = (1, 2, g) + (1, c, 3) and reduce 
the triple (1, c, 3). For g = 2 we apply the relation (1, 2, 3) + (1, 3, c) + (1, c, 2) = (1, 2, c) + 
(1, 3, 2) + (1, c, 3) and reduce the triple (1, 2, c). 

We continue to the second node in the third row. We assume that there is a pair (3, /) 93 
among n^. If I ^ c we apply the relation (1, 2, c) + (3, /) 93 = (1, 2, 3) + (c, Z) 93 and reduce the 
triple (1, 2, 3). If there was a pair (c, "m) 93 among then analogously we would have m = 3 
and we would be able to reduce this pair. So there must be a triple (l,n, c) among n,. If 
n^3 then we apply the relation (1, 2, 3) + (1, n, c) = (1, n, 3) + (1, 2, c) and reduce the triple 
(1, 2, c). So we assume ri2 = (1, 3, c). Hence there is either a pair (3, o) 92 or a triple (1, 3,p) 
among n^. 

We move to the third node in the fourth row - suppose that there is a triple (1, 3, p) among 
n'i. If p 7^ 2 we apply the relation (1, 2, c) + (l, 3,p) = (1, 2,p) + (l, 3, c) and we reduce (1, 3, c). 
So we have p = 2. We apply the relation (1, 3, 2) + (3, c) 93 = (1, 3, c) + (2, 3) 93 and reduce 
the triple (1, 3, c). 

We pass to the fourth node in the fourth row - we assume that there is a pair (3, o) 92 and 
there is no triple (1, 3,p) among n^. If o ^ 2 then we apply the relation (1, 2, c) + (3, o) 92 = 
(1, 3, c) + (2, o) 92 and reduce (1, 3, c). So there is a pair (2, 3) 92 among Suppose that this 
pair appears r > times among n^. Note that there are no pairs (2, s) 92 among rij. Indeed 
suppose that there is such a pair. If s ^ 3 then we apply the relation (1,3, c) + (2, s) 92 = 
(1,2, c) + (3, s) 92 and reduce the triple (l,2,c). If s = 3 we reduce the pair (2,3) 92 . Hence 
there must be at least r + 1 triples of the type (1,2, t) among n^. If there is a triple 
with t ^ 3 then we apply the relation (l,3,c) + (1,2, t) = (1,3, t) + (l,2,c) and reduce 
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the triple (1,2, c). Hence we have got at least r + 1 triples (1,2,3) among rij. Notice 
that there are no triples of the type (l,y,3) among n[. Indeed, in such a case we could 
apply the relation (l,y, 3) + (2, 3) S2 = (1,2,3) + (y,3) 92 and reduce (1,2,3). Hence there 
must be at least r + 1 pairs of the type (3, u) g3 among n^. If u ^ c then we apply the 
relation (1, 2, c) + (3, u) 93 = (1, 2, 3) + (c, u) 93 and reduce the triple (1, 2, 3). Hence we have 
at least r + 1 pairs (3, c) g3 among n\. Note that there are no pairs of the type (c, v) g3 
among rij. Indeed if v = 3 we could reduce this pair. If v ^ 3 then we apply the relation 
(1,2,3) + (c, v) ga = (l,2,c) + (3, v) ga and reduce the triple (1,2, c). Hence we must have 
at least r + 1 triples of the type (l,z,c) among rij. If z ^ 3 then we apply the relation 
(1,2,3) + (1,2, c) = (l,2,c) + (1,2:,3) and reduce the triple (l,2,c). So there are at least 
r + 1 triples (1, 3, c) among rij. Note that the elements #2 on 3 cannot be reduced - among v! i 
there are only r pairs containing them and no triples. The contradiction finishes this case. 

Consider the third node in the third row - there is a pair (c,w) g3 among n^. This is 
completely analogous to the second node in this row, already considered. 

Also the second node in the second row - c = 3 - is analogous to the first node in the 
second row. 

We are left with the last, third node in the second column - any two triples rij and n'- 
agree on exactly one index, that is on 1. Due to Lemma 17. 141 we can assume b ^ 2 and 6^3. 
Due to the case assumption there must be a pair (b, d) 92 among rij. If d ^ 2 then we apply 
the relation (1, 2, 3) + (b, d) 92 = (1, b, 3) + (d, 2) 92 and reduce to the case b = 20. Analogously 
we must have the same pair among n! i and it can be reduced. 

7.3.2. Case: k = 2 or k = 3. Suppose now that k = 2. Let n\ = (1,2,3) and n[ = (1,2, c). 
If we cannot reduce then there must be a pair (c, d) g3 among rii and a pair (3, e) 93 among 
n't- If d = 3 and e = c we can reduce the pairs. Thus we can assume that d ^ 3. We apply 
the relation (1, 2, 3) + (c, d) 93 = (1, 2, c) + (3, d) 93 and reduce the triple (1, 2, c). 

The last, easiest case is k = 3. Then all triples are equal to (1,2,3) and there are no 
pairs due to Corollary 17.131 Hence we can reduce the triples. This finishes the proof of the 
theorem. 



We have already recalled two conjectures of Sturmfels and Sullivant 11.11 and 11.21 One can 
state simpler conjectures, that we find important. 

Conjecture 8.1. For any group G and any tree T the ideal I(X(G,T)) and the subideal 
generated in degree \G\ define the same projective scheme. 

We believe that the methods of Section [7] could be adopted, especially for groups of small 
cardinality. 

Consider a claw tree K^ n . Subdivide the set of leaves into two separate subsets. By adding 
an inner edge that separates this set one obtains a tree T as shown below. 



Notice that we do not reduce to the case k = 2 as if this was true we would have already been in the 
first node in the second column b = 2. 



8. Open problems 
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One can easily see that for any@ group G we have X{K\^G) C X(T,G). As long as the 
subsets of leaves are of cardinality at least 2 the procedure decreases the maximal degree 
and allows for an inductive generation of phylogenetic invariants. 

Conjecture 8.2 ( |DBM] Conjecture 3.6). The variety X(K ln ,G) is a scheme theoretic 
intersection of all X(T, G), where T is a prolongation with maximal degree at most n — 1. 

This conjecture would imply the following. 

Conjecture 8.3. The function 4>(G, •) is constant. 

In view of the previous conjecture and Conjecture II . 21 it is natural to bound <p(G, Ki 3 ). It 
is an open problem if 4>(G, Kig) < \G\. On the other hand, by general [Stu96t Theorem 4.7] 
we have an exponential bound on 0(G,i^ lj3 ). The following question, as far as we know, is 
open and would be an interesting step towards conjectures of Sturmfels and Sullivant. 

Conjecture 8.4. There exists a polynomial P such that <j)(G,K lt3 ) < P{G). 

One obtains a much more difficult open problem by replacing 3 in the previous conjecture 
by any (fixed or even non fixed) number. 

The following conjecture concerns normality of varieties X(T,G). 

Conjecture 8.5. The variety X(T, 7*2 x ^2) is projectively normal. 

The previous conjecture is known for trees of small maximal degree. Note that X(Ki i3 , Z^) 
is not normal [DBM[ Computation 4.3]. However analogous questions for many other groups 
are open. For the group G = Z2 the reader can consult |DBMj . 

For group-based models the cardinality of the group G equals the dimension of the vector 
space V associated to any node. The dimension of the space associated to the node, for the 
general Markov model determines which secant of the Segre we consider. It is well-known 
(and easy to prove) that there are no equations of degree less or equal to k for the k-th 
secant. 

Conjecture 8.6. Is the k-th secant of the Segre variety defined by equations of degree f{k) 
where f is a linear function? 
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